Symmetry in a Perturbed Optical System 
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This paper investigates the effects that perturbations to an optical system, such as translations 
or rotations of the optical elements, have on the final location where a light ray strikes a detector. 
Symmetry arguments are employed to give selection rules for the combinations of those perturbations 
that can affect the light ray. It is shown that a "Transverse Parity" can be assigned to each type 
of perturbation and that only combinations with an overall odd parity will produce an effect. Some 
examples of how these results can be applied to undergraduate classroom or laboratory courses are 
given. 



I. INTRODUCTION 

The study of physics is replete with the use of symmetry 
to explain physical phenomena. The conservation of energy, 
momentum, and electric charge stem from symmetries as 
do the selection rules for allowed quantum-mechanical in- 
teractions or multipole moments of a force field or mass 
distribution. Indeed, in the undergraduate curriculum one 
finds symmetries and their application in nuclear and par- 
ticle physics, electromagnetism, quantum mechanics, and 
statistical mechanics to name a few. It comes as no sur- 
prise that symmetries play a role in optics as well. Indeed, 
once one moves beyond the nominal placement of the ele- 
ments of an optical system and studies, instead, the effects 
of perturbations to such systems, the effects of symmetry 
in optics become more subtle and more rich. 

This work investigates some roles that symmetry plays 
in an optical system that has been perturbed from its nom- 
inal configuration. This investigation reveals straightfor- 
ward relations between the perturbations (translations or 
rotations) of the optical elements that can combine to pro- 
duce a displacement of the beam spot or a rotation of the 
incident angle of the beam as it impinges upon a screen or 
detector. A straightforwared symmetry argument gives the 
selection rules for the allowed couplings among the pertur- 
bations. 

The advantage of this development for the undergraduate 
is that, unlike most treatments of symmetry (e.g. quantum 
mechanics), an optical system is easily manipulated and the 
consequence of changes to the system are immediately ap- 
parent. The relative transparancy of the effects of changes 
to an optical system allows the student to focus on under- 
standing the underlying physical principles instead of the 
proper use of a given experimental apparatus. With a few 
basic optical elements and a well designed set of exercizes, 
a physics student can study one of the most important as- 
pects of modern physical theory and appreciate the power 
of using symmetries to describe physical scenarios. The 
materials presented here are the aspects of a more general 
study that are readily incorporated into a standard under- 
graduate lecture or laboratory course. 

The original motivation for this work was to examine the 
systematic effects that are introduced into an experiment 
due to imperfections in the placement and alignment of the 
optical elements in several, nominally-identical optical sub- 
systems. In particular, an array of eight optical systems 



was used to determine the parameters of motion of a tor- 
sion pendulum undergoing large-amplitude torsional oscil- 
lations. Since the experiment required nanosecond timing 
precision, precise placement of the optical elements was im- 
portant as well as understanding the leading systematic ef- 
fects. The results of that investigation are the basis for this 
work. 

This paper will proceed by first introducing the approach 
that was used for developing the optical system model along 
with the assumptions that are made about the optical el- 
ements themselves. Some short examples are provided to 
show the nature of the model and the results it bears. Next, 
selection rules for the allowed couplings of optical element 
perturbations are demonstrated and derived via the intro- 
duction of a "Transverse Parity" that is assigned to the 
various ways that an element can be perturbed. Finally, 
attention is given to the application of this work to the 
undergraduate classroom or laboratory. 



II. MODEL OF THE OPTICAL SYSTEM 

For this study, the optical system is modeled using a quasi 
object-oriented approach where the position and orientation 
information of a given lens, mirror, ray, etc. is encapsulated 
in a lens, mirror, or ray object. For example, a ray object 
consists of a point of origin and a direction vector, a lens 
object consists of a focal length, a center point, and a nor- 
mal vector. The effect of a ray refracting through a lens is 
found using a function that takes a given ray and lens, then 
returns the refracted ray using analytic geometry. 

This method is different from the standard geometrical 
or two-dimensional matrix formulation that is presented 
in most undergraduate optics textsi^ and from more gen- 
eral three dimensional matrix approaches 3 . This approach 
was chosen because it facilitates the three-dimensional ma- 
nipulation of the optical elements and allows for a more 
straightforward interpretation of the analytic results that 
it produces. Most of the computation was done with the 
symbol-manipulating software Mathematica. 

The following assumptions are made throughout this 
work: all of the optical elements are perfect, thin, are suf- 
ficiently large that displacements and rotations of the ele- 
ments still allow the beam to impinge upon them, and that 
refraction is not wavelength dependent. It is also conve- 
nient to assume that the deviations of the optical elements 
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from their nominal position are small so that they can be 
treated perturbativelyA That is, rotations are small and 
displacements are small compared to the appropriate focal 
lengths. Finally, only a single, central ray of the beam is 
considered in this investigation. Some comments concern- 
ing an extended beam are made near the end of the paper. 

A. Example: Linear Displacement I 

The first example to consider is that of a single lens and 
detector nominally placed along the longitudinal (or "op- 
tic") axis. If the lens is displaced vertically a distance dy, as 
shown in figure ^ then the location where the beam strikes 
the detector is also displaced vertically by an amount 



where / is the focal length of the lens, si is the longitudinal 
position of the lens, and Sd is the longitudinal position of the 
detector. Equation shows that the vertical displacement 
of the beam spot is linear in the displacement of the lens and 
also linear in the separation of the lens and detector — the 
"lever arm". 
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FIG. 1: A light ray refracting through a lens of focal length / 
which is displaced vertically by an amount dy from its nominal 
position. This perturbation causes the final location of the beam 
to be displaced vertically by an amount h. 



B. Example: Linear Displacement II 



where s±, S2, and Sd are the positions of the first lens, the 
second lens, and the detector respectively and /i and / 2 are 
the focal lengths of the lenses. 
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FIG. 2: A light ray refracting through two lenses. One, with 
focal length /i is displaced vertically by an amount dyi. The 
other, with focal length fa is displaced by an amount dyi. The 
vertical displacement of the beam at the screen is given by h. 



This example shows that the displacement of the beam 
spot grows linearly with the vertical displacements of the 
lenses, as expected. It also shows that there are two com- 
peting effects that depend upon the separation of the optical 
elements. For small separations of the optical elements the 
spot height grows linearly with the nominal placements of 
the lenses. For large separations of the elements, the height 
falls quadratically. 

A more interesting feature of equation J5J) is that there is 
no coupling between the displacements dyi and dyi- Gener- 
ally, one might expect that cross terms of the form dy\dyi 
might exist, yet they do not. If a third lens is included, 
then there are three terms in the resulting height of the 
beam spot. Each term is linear in the displacement of a 
particular lens, and again, no cross terms exist. This fact 
holds true for any displacements that lie in the plane that 
is transverse to the incoming beam. This trend, however, 
does not apply for longitudinal displacements. It can be 
seen from equation that a small displacement of the lens 
in the longitudinal direction (s; — * si+dz) would contribute 
a quadratic term of the form dydz/ f. 



Now, suppose that two lenses are used in the optical sys- 
tem, again aligned nominally along the longitudinal axis. 
If the two lenses are displaced vertically by amounts dyi 
and dyi , as shown in figure |3 then the resulting vertical 
displacement of the beam spot is given by 




C. Example: Rotation 

In the model used for this paper, each rotation of an opti- 
cal element is about a body centered axis that remains par- 
allel to the corresponding lab- frame axis. Thus, the origin 
of the rotation axes move with the lens, but the orientation 
of the axes do not. To see the effects of rotation, consider 
the case of a single lens displaced vertically a distance dy 
and rotated about the x axis by an angle da as shown in 
figure|3J This example, to third order in the perturbations, 
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gives a vertical displacement of the beam on the detector 
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(3) 



dyda 



Unlike the previous examples where all the perturbations 
were displacements, here there are cross terms between the 
transverse displacements and the rotations of the lens about 
the transverse axes. Regardless of the number of elements 
in the optical system, each term in equations like @ will 
contain an odd number of transverse perturbative factors 
(i.e. factors that correspond to perturbations in the trans- 
verse directions). Similar to the previous examples, if there 
is a rotation about the longitudinal axis, then some cou- 
plings will contain an even number of perturbative factors. 
However, the terms that involve longitudinal perturbations 
still have an odd number of transverse perturbations. The 
selection rules for terms that ultimately contribute to the 
displacement of the beam spot are addressed in the next 
section. 




FIG. 3: A light ray is refracted by a lens of focal length / which 
is displaced vertically a distance dy and rotated by an angle da 
about the x-axis. The resulting displacement of the beam on 
the detector is given by h. 



III. TRANSVERSE PARITY 

The examples presented in the last section show that not 
all possible combinations of displacements and rotations of 
the optical elements in a system will change the final posi- 
tion of the beam spot. The derivation of the selection rules 
for the allowed terms is relatively straightforward. The dis- 
placement of the beam spot in the detector (x, y) can be 
expressed to arbitrary order as a polynomial function of the 



displacements and rotations of the set of optical elements 
in the system 

(x, y) = f({dx}, {dy}, {dz}, {da}, {dp}, {dj}) (4) 

where a, [3, and 7 are rotations about the x, y, and z 
axes respectively and the curly braces represent the set 
of all perturbations of a given kind (i.e. {dx} represents 
the x displacements of the set of optical elements). If 
the entire system is reflected through the nominal axis of 
symmetry (or rotated about the axis of symmetry by tt) 
then the beam spot must also reflect in a similar manner, 
(x,y) — ► (— x, — y). Therefore, each term in the polyno- 
mial function must have odd symmetry in the plane 
transverse to the axis of symmetry. 

From this discussion, one can define a preliminary, multi- 
plicative, transverse parity operator V such that transverse 
perturbations, whether displacements or rotations, change 
sign while the signs of longitudinal displacements or rota- 
tions remain unchanged 

V(x, y, z, a, (3, 7) — > (-x, -y, z, -a, -(3, 7). (5) 

Every possible perturbation to an optical element, whether 
a lens, detector, or mirror, etc. can be designated as having 
either even or odd transverse parity, though for a general 
optical element the transverse perturbations need not have 
odd transverse parity (hence "preliminary"). The expres- 
sion for the final location of the beam spot can only be 
expressed using terms that have overall odd parity. 

From the example of two lenses given earlier, each of the 
displacements of the lenses has odd transverse parity and 
the result @ has two terms, both linear in the transverse 
displacements and thus both with odd transverse parity. A 
longitudinal displacement of a lens has even parity. Such 
a displacement would allow a coupling of the form dydz to 
contribute to the location of the beam spot since the overal 
parity of such a term is still odd 



V(dydz) — {—dy){dz) = —dydz 



(6) 



Each term in the rotation example also has odd parity. The 
third term in equation (j3J, for example, is proportional to 
dyda 2 which has parity 



V{dyda 2 ) = (-dy)(-daf 



-dyda 



(7) 



One application of transverse parity is to shorten the cal- 
culation (to a given order) of the final location of the beam 
spot in a large optical system. Instead of allowing the com- 
puter to calculate all the terms in the expansion, thereby 
spending a lot of time on terms that are ruled out because 
of incorrect parity, one can reject such terms immediately 
and focus on those that are allowed to contribute. 



A. Other Symmetries 

Other symmetries that are inherant in a particular op- 
tical element can further restrict the allowed terms in Q 
and may affect the parity of a particular perturbation. For 
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example, a mirror has translational symmetry in the trans- 
verse plane and a cylindrical lens has translational symme- 
try along its axis of cylindrical symmetry. A translation 
of a cylindrical lens along that axis or a rotation about 
that axis would produce no effect on the incident beam and 
would therefore have even transverse parity. 

Symmetry arguments such as this also demonstrate why 
even-parity combinations of otherwise odd-parity perturba- 
tions are not allowed. Consider the forbidden combination 
of dydfi for a standard lens, a vertical translation followed 
by a rotation about the vertical axis. Since the vertical 
translation maintains the left-right symmetry of a lens, no 
rotation about that same axis will produce a change in the 
location of the beam spot. 

The parity of a given perturbation to a given optical ele- 
ment is ultimately defined by how that perturbation affects 
either the ray of light that passes through the geometric 
center of the optical element for translations or how it af- 
fects a ray of light that strikes the element at a point that 
is off of the rotation axis of a rotation-type perturbation. 
A perturbation that produces a result with odd symmetry 
has odd transverse parity, and one that gives an even result 
would have even symmetry. These definitions of parity only 
slightly change the parity operation given in (J5J in that the 
odd parity perturbations will change sign and the even per- 
turbations will not, regardless of the axis involved in the 
perturbation. 

B. A Real Beam 

Transverse parity can also describe the behavior of a 
beam with physical extent. A ray that is displaced from 
the center of the beam would have an overall odd trans- 
verse parity. The same is true for a ray that is not paral- 
lel to the longitudinal axis. That transverse parity applies 
here is not surprising since, using a finite number of lenses 
and a central ray, one can reconstruct any other ray in the 
physical beam. The physical beam is essentially a superpo- 
sition of odd-parity, perturbed, central rays and each of the 
terms that are allowed in this construction must already 
have overall odd transverse parity. 



tions such as Q or Q using either a matrix optics approach 
or analytic geometry. Another possibility, directly applica- 
ble to laboratory research, is to determine the sources of 
systematic error in a given optical system and their relative 
importance. Such a study could include how those errors 
scale with the nominal placement of the optical elements 
(e.g. the separation of a light source and a collomating 
lens) or with perturbations to the elements. The introduc- 
tion of a cylindrical lens presents an opportunity to study 
the translational symmetry of the lens and also the effects 
that arise from the addition of the "preferred direction" 
along the axis of cylindrical symmetry. 

Beyond the direct application to optics, there are many 
aspects of this work that pertain to many branches of 
physics. Selection rules, parity, symmetry, dimensionless 
ratios, and perturbations are ubiquitous in physics and can 
all be studied with this model. A student could be asked to 
determine if a given coupling is allowed, to determine the 
coefficient that corresponds to that perturbation, or even to 
empirically find the analytic expression for that same coef- 
ficient by using the dimensionless ratios that can be formed 
by the separation of the lenses and their focal lengths. Sim- 
ilarly, a student could study how the coefficients scale with 
the nominal separation of the optical elements, such as the 
expression that multiplies the perturbation to the first lens 
in equation @ which scales linearly with short separations 
and quadratically with large separations. 

Using the perturbed optical system one can learn about 
many important tools that physicists use in their efforts 
to understand natural law. The apparatus, consisting of 
a beam of light and a few lenses, is relatively inexpensive 
and easy to manipulate. The effects of a perturbation to 
the system appear instantly on a screen or detector which 
transparency allows for a deeper investigation of the phys- 
ical principles being studied instead of simply a tutorial 
on the use of a particular piece of laboratory equipment. 
This system should prove useful as a pedagogical device for 
undergraduates, both as a theoretical tool and as an exper- 
imental investigation of a simple theory. 



IV. DISCUSSION 
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